
(1) GENERAL INFORMATION: 

5 

(i) APPLICANT: 

(A) NAME: Akzo nobel n.v. 

(B) STREET: Velperweg 76 

(C) CITY: Arnhem 

10 { E ) COUNTRY: The Netherlands 

(F) POSTAL CODE (ZIP) : 6824 BM 

(G) TELEPHONE: 0412-666379 

(H) TELEFAX: 0412-650592 

(I) TELEX: 37503 akpha nl 

15 

(ii) TITLE OF INVENTION: Novel estrogen receptor 
(iii) NUMBER OF SEQUENCES: 28 

2 0 (iv) COMPUTER READABLE FORM: 

(A) MEDIUM TYPE: Floppy disk 

(B) COMPUTER: IBM PC compatible 

(C) OPERATING SYSTEM: PC-DOS/MS-DOS 

(D) SOFTWARE: Patentln Release #1.0, Version #1.30 (EPO) 

25 

(2) INFORMATION FOR SEQ ID NO: 1: 

(i) SEQUENCE CHARACTERISTICS: 
30 (A) LENGTH: 1434 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

35 (ii) MOLECULE TYPE: cDNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1: 



M ~~ W 
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ATGAATTACA GCATTCCCAG CAATGTCACT AACTTGGAAG GTGGGCCTGG TCGGCAGACC 60 

ACAAGCCCAA ATGTGTTGTG GCCAACACCT GGGCACCTTT CTCCTTTAGT GGTCCATCGC 12 0 

CAGTTATCAC AT CT GT AT GC GGAACCTCAA AAGAGTCCCT GGTGTGAAGC AAGATCGCTA 180 

GAACACACCT T AC C T GT AAA CAGAGAGACA C T GAAAAGGA AGGTTAGTGG GAACCGTTGC 24 0 

GCCAGCCCTG TTACTGGTCC AGGTTCAAAG AGGGATGCTC ACTTCTGCGC TGTCTGCAGC 3 00 

GAT T AC G CAT C GGGAT AT CA CTATGGAGTC TGGTCGTGTG AAGGATGTAA GGCCTTTTTT 360 

AAAAGAAGCA TTCAAGGACA TAATGATTAT ATTTGTCCAG CTACAAAT C A GTGTACAATC 42 0 

G AT AAAAAC C GGCGCAAGAG CTGCCAGGCC TGCCGACTTC GGAAGT GT T A CGAAGTGGGA 4 80 

AT GGT GAAGT GTGGCTCCCG GAGAGAGAGA TGTGGGTACC GCCTTGTGCG GAGACAGAGA 54 0 

2 0 AGTGCCGACG AGCAGCTGCA CTGTGCCGGC AAGGCCAAGA GAAGTGGCGG CCACGCGCCC 600 

CGAGTGCGGG AGCTGCTGCT GGACGCCCTG AGCCCCGAGC AGCTAGTGCT CACCCTCCTG 660 

GAGGCTGAGC CGCCCCATGT GCTGATCAGC CGCCCCAGTG CGCCCTTCAC CGAGGCCTCC 72 0 

25 

AT GAT GAT GT CCCTGACCAA GTTGGCCGAC AAGGAGTTGG TACACATGAT CAGCTGGGCC 78 0 

AAGAAGATTC CCGGCTTTGT GGAGCTCAGC CTGTTCGACC AAGTGCGGCT CTTGGAGAGC 84 0 

3 0 TGTTGGATGG AGGTGTTAAT GATGGGGCTG ATGTGGCGCT CAATTGACCA CCCCGGCAAG 90 0 

CTCATCTTTG CTCCAGATCT TGTTCTGGAC AGGGATGAGG GGAAATGCGT AGAAGGAATT 9 60 

CTGGAAATCT TTGACATGCT CCTGGCAACT ACTTCAAGGT TTCGAGAGTT AAAACTCCAA ;02 0 

CACAAAGAAT ATCTCTGTGT CAAGGCCATG ATCCTGCTCA ATTCCAGTAT GTACCCTCTG 108 0 

GTCACAGCGA CCCAGGATGC TGACAGCAGC CGGAAGCTGG CTCACTTGCT GAACGCCGTG 114 0 



35 
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20 
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ACCGATGCTT TGGTTTGGGT GATTGCCAAG AGCGGCATCT CCTCCCAGCA GCAATCCATG 12 0 0 



CGCCTGGCTA ACCTCCTGAT GCTCCTGTCC CACGTCAGGC ATGCGAGTAA CAAGGGCATG 12 60 

GAACATCTGC TCAACATGAA GTGCAAAAAT GTGGTCCCAG TGTATGACCT GCTGCTGGAG 132 0 

ATGCTGAATG CCCACGTGCT TCGCGGGTGC AAGTCCTCCA TCACGGGGTC CGAGTGCAGC 1380 



CCGGCAGAGG ACAGTAAAAG CAAAGAGGGC TCCCAGAACC CACAGTCTCA GTGA 14 34 



(2) INFORMATION FOR SEQ ID NO: 2: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1251 base pairs 
15 (B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: cDNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2: 

2 5 ATGAATTACA GCATTCCCAG CAATGTCACT AACTTGGAAG GTGGGCCTGG TCGGCAGACC 60 

ACAAGCCCAA ATGTGTTGTG GCCAACACCT GGGCACCTTT CTCCTTTAGT GGTCCATCGC 12 0 

CAGTTAT CAC ATCTGTATGC GGAACCTCAA AAGAGTCCCT GGTGTGAAGC AAGAT C G C T A 18 0 

30 

GAACAC AC CT T AC CT GTAAA CAGAGAGACA CTGAAAAGGA AGGTTAGTGG GAACCGTTGC 24 0 

GCCAGCCCTG TTACTGGTCC AGGTTCAAAG AGGGATGCTC ACTTCTGCGC TGTCTGCAGC 300 

35 GATTACGCAT CGGGATAT CA CT AT GGAGT C TGGTCGTGTG AAGGATGTAA GGCCTTTTTT 360 

AAAAG AAG C A TTCAAGGACA TAATGATTAT ATTTGTCCAG CTACAAATCA GTGTACAATC 42 0 

GAT AAAAAC C GGCGCAAGAG CTGCCAGGCC TGCCGACTTC GGAAGTGTTA CGAAGTGGGA 48 0 
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ATGGTGAAQT GTGGCTCCCG GAGAGAGAGA TGTGGGTACC GCCTTGTGCG GAGACAGAGA 54 0 

AGTGCCGACG AGCAGCTGCA CTGTGCCGGC AAGGCCAAGA GAAGTGGCGG CCACGCGCCC 60 0 

5 

CGAGTGCGGG AGCTGCTGCT GGACGCCCTG AGCCCCGAGC AGCTAGTGCT CACCCTCCTG 660 

GAGGCTGAGC CGCCCCATGT GCTGATCAGC CGCCCCAGTG CGCCCTTCAC CGAGGCCTCC 72 0 

10 AT GAT GAT GT CCCTGACCAA GTTGGCCGAC AAGGAGTTGG TACACATGAT CAGCTGGGCC 78 0 

AAGAAGATTC CCGGCTTTGT GGAGCTCAGC CTGTTCGACC AAGTGCGGCT CTTGGAGAGC 84 0 

TGTTGGATGG AGGTGTTAAT GATGGGGCTG ATGTGGCGCT CAATTGACCA CCCCGGCAAG 900 

15 

CTCATCTTTG CTCCAGATCT TGTTCTGGAC AGGGATGAGG GGAAATGCGT AGAAGGAATT 960 

CTGGAAATCT TTGACATGCT CCTGGCAACT ACTTCAAGGT TTCGAGAGTT AAAACTCCAA 102 0 

2 0 CACAAAGAAT ATCTCTGTGT CAAGGC CAT G ATCCTGCTCA ATTCCAGTAT GTACCCTCTG 108 0 

GTCACAGCGA CCCAGGATGC TGACAGCAGC CGGAAGCTGG CTCACTTGCT GAACGCCGTG 114 0 

ACCGATGCTT TGGTTTGGGT GATTGCCAAG AGCGGCATCT CCTCCCAGCA GCAAT C CAT G 12 00 

25 

CGCCTGGCTA ACCTCCTGAT GCTCCTGTCC CACGTCAGGC ATGCGAGGTG A 12 51 
(2) INFORMATION FOR SEQ ID NO: 3: 

30 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 66 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

( D) TOPOLOGY: linear \ 



35 



(ii) MOLECULE TYPE: peptide 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 3: 

Cys Ala Val Cys Ser Asp Tyr Ala Ser Gly Tyr His Tyr Gly Val Trp 
15 10 15 

Ser Cys Glu Gly Cys Lys Ala Phe Phe Lys Arg Ser lie Gin Gly His 

20 25 30 

Asn Asp Tyr lie Cys Pro Ala Thr Asn Gin Cys Thr lie Asp Lys Asn 
35 40 45 

Arg Arg Lys Ser Cys Gin Ala Cys Arg Leu Arg Lys Cys Tyr Glu Val 
50 55 60 

Gly Met 

65 

INFORMATION FOR SEQ ID NO: 4: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2 33 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 4: 

Leu Val Leu Thr Leu Leu Glu Ala Glu Pro Pro His Val Leu lie Ser 
15 10 15 

Arg Pro Ser Ala Pro Phe Thr Glu Ala Ser Met Met Met Ser Leu Thr 

20 25 30 

Lys Leu Ala Asp Lys Glu Leu Val His Met lie Ser Trp Ala Lys Lys 
35 40 45 
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lie Pro Gly Phe Val Glu Leu Ser Leu Phe Asp Gin Val Arg Leu Leu 

50 55 60 

Glu Ser Cys Trp Met Glu Val Leu Met Met Gly Leu Met Trp Arg Ser 
65 70 75 80 

lie Asp His Pro Gly Lys Leu lie Phe Ala Pro Asp Leu Val Leu Asp 
85 90 95 

Arg Asp Glu Gly Lys Cys Val Glu Gly lie Leu Glu lie Phe Asp Met 
100 105 110 



15 



Leu Leu Ala Thr Thr Ser Arg Phe Arg Glu Leu Lys Leu Gin His Lys 
115 120 125 



Glu Tyr Leu Cys Val Lys Ala Met He Leu Leu Asn Ser Ser Met Tyr 
130 135 140 



20 



Pro Leu Val Thr Ala Thr Gin Asp Ala Asp Ser Ser Arg Lys Leu Ala 
145 150 155 160 



25 



His Leu Leu Asn Ala Val Thr Asp Ala Leu Val Trp Val He Ala Lys 
165 170 175 

Ser Gly He Ser Ser Gin Gin Gin Ser Met Arg Leu Ala Asn Leu Leu 
180 185 190 



30 



Met Leu Leu Ser His Val Arg His Ala Ser Asn Lys Gly Met Glu His 
195 200 205 



Leu Leu Asn Met Lys Cys Lys Asn Val Val Pro Val Tyr Asp Leu Leu 

210 215 220 



35 



Leu Glu Met Leu Asn Ala His Val Leu 
225 230 



(2) INFORMATION FOR SEQ ID NO: 5: 



\ 
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(i) SEQUENCE CHARACTERI ST ICS : 

(A) LENGTH: 477 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 
{ D } TOPOLOGY : unknown 

(ii) MOLECULE TYPE: protein 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 5: 

Met Asn Tyr Ser lie Pro Ser Asn Val Thr Asn Leu Glu Gly Gly Pro 
15 10 15 

Gly Arg Gin Thr Thr Ser Pro Asn Val Leu Trp Pro Thr Pro Gly His 
20 25 30 

Leu Ser Pro Leu Val Val His Arg Gin Leu Ser His Leu Tyr Ala Glu 
35 40 45 

Pro Gin Lys Ser Pro Trp Cys Glu Ala Arg Ser Leu Glu His Thr Leu 
50 55 60 

Pro Val Asn Arg Glu Thr Leu Lys Arg Lys Val Ser Gly Asn Arg Cys 
65 70 75 80 

Ala Ser Pro Val Thr Gly Pro Gly Ser Lys Arg Asp Ala His Phe Cys 
85 90 95 

Ala Val Cys Ser Asp Tyr Ala Ser Gly Tyr His Tyr Gly Val Trp Ser 
100 105 HO 

Cys Glu Gly Cys Lys Ala Phe Phe Lys Arg Ser lie Gin Gly His Asn 

115 120 125 



Asp Tyr lie Cys Pro Ala Thr Asn Gin Cys Thr lie Asp Lys Asn Arg 
130 135 140 



Arg Lys Ser Cys Gin Ala Cys Arg Leu Arg Lys Cys Tyr Glu Val Gly 
145 150 155 160 



Met Val Lys Cys Gly Ser Arg Arg Glu Arg Cys Gly Tyr Arg Leu Val 
165 170 175 

Arg Arg Gin Arg Ser Ala Asp Glu Gin Leu His Cys Ala Gly Lys Ala 
180 185 190 

Lys Arg Ser Gly Gly His Ala Pro Arg Val Arg Glu Leu Leu Leu Asp 
195 200 205 

Ala Leu Ser Pro Glu Gin Leu Val Leu Thr Leu Leu Glu Ala Glu Pro 
210 215 220 

Pro His Val Leu lie Ser Arg Pro Ser Ala Pro Phe Thr Glu Ala Ser 
225 230 235 240 

Met Met Met Ser Leu Thr Lys Leu Ala Asp Lys Glu Leu Val His Met 
245 250 255 

lie Ser Trp Ala Lys Lys lie Pro Gly Phe Val Glu Leu Ser Leu Phe 
260 265 270 

Asp Gin Val Arg Leu Leu Glu Ser Cys Trp Met Glu Val Leu Met Met 

275 280 285 

Gly Leu Met Trp Arg Ser lie Asp His Pro Gly Lys Leu lie Phe Ala 

290 295 300 

Pro Asp Leu Val Leu Asp Arg Asp Glu Gly Lys Cys Val Glu Gly lie 

305 310 315 320 

Leu Glu lie Phe Asp Met Leu Leu Ala Thr Thr Ser Arg Phe Arg Glu 

325 330 335 



Leu Lys Leu Gin His Lys Glu Tyr Leu Cys Val Lys Ala Met lie Leu 
340 345 350 




\ 
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Leu Asn Ser Ser Met Tyr Pro Leu Val Thr Ala Thr Gin Asp Ala Asp 
355 360 365 



Ser Ser Arg Lys Leu Ala His Leu Leu Asn Ala Val Thr Asp Ala Leu 
5 370 375 380 

Val Trp Val lie Ala Lys Ser Gly lie Ser Ser Gin Gin Gin Ser Met 
385 390 395 400 

10 Arg Leu Ala Asn Leu Leu Met Leu Leu Ser His Val Arg His Ala Ser 

405 410 415 

Asn Lys Gly Met Glu His Leu Leu Asn Met Lys Cys Lys Asn Val Val 
420 425 430 

15 

Pro Val Tyr Asp Leu Leu Leu Glu Met Leu Asn Ala His Val Leu Arg 
435 440 445 

Gly Cys Lys Ser Ser Tie Thr Gly Ser Glu Cys Ser Pro Ala Glu Asp 
20 450 455 460 

Ser Lys Ser Lys Glu Gly Ser Gin Asn Pro Gin Ser Gin 
465 470 475 

25 (2) INFORMATION FOR SEQ ID NO: 6: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 416 amino acids 

(B) TYPE: amino acid 

30 (C) STRANDEDNESS : single 

( D ) TOPOLOGY : unknown 

(ii) MOLECULE TYPE: protein 

35 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 6: 

Met Asn Tyr Ser lie Pro Ser Asn Val Thr Asn Leu Glu Gly Gly Pro 




- 3* 
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15 



Gly Arg Gin Thr Thr Ser Pro Asn Val Leu Trp Pro Thr Pro Gly His 
20 25 30 

Leu Ser Pro Leu Val Val His Arg Gin Leu Ser His Leu Tyr Ala Glu 
35 40 45 



10 



Pro Gin Lys Ser Pro Trp Cys Glu Ala Arg Ser Leu Glu His Thr Leu 
50 55 60 



Pro Val Asn Arg Glu Thr Leu Lys Arg Lys Val Ser Gly Asn Arg Cys 
65 70 75 80 



15 



Ala Ser Pro Val Thr Gly Pro Gly Ser Lys Arg Asp Ala His Phe Cys 
85 90 95 



20 



Ala Val Cys Ser Asp Tyr Ala Ser Gly Tyr His Tyr Gly Val Trp Ser 
100 105 110 

Cys Glu Gly Cys Lys Ala Phe Phe Lys Arg Ser lie Gin Gly His Asn 

115 120 125 



25 



Asp Tyr lie Cys Pro Ala Thr Asn Gin Cys Thr lie Asp Lys Asn Arg 

130 135 140 



Arg Lys Ser Cys Gin Ala Cys Arg Leu Arg Lys Cys Tyr Glu Val Gly 
145 150 155 160 



30 



Met Val Lys Cys Gly Ser Arg Arg Glu Arg Cys Gly Tyr Arg Leu Val 
165 170 175 



35 



Arg Arg Gin Arg Ser Ala Asp Glu Gin Leu His Cys Ala Gly Lys Ala 

180 185 190 

Lys Arg Ser Gly Gly His Ala Pro Arg Val Arg Glu Leu Leu Leu Asp 

195 200 205 



Ala Leu Ser Pro Glu Gin Leu Val Leu Thr Leu Leu Glu Ala Glu Pro 



210 215 220 

Pro His Val Leu lie Ser Arg Pro Ser Ala Pro Phe Thr Glu Ala Ser 
225 230 235 240 

Met Met Met Ser Leu Thr Lys Leu Ala Asp Lys Glu Leu Val His Met 
245 250 255 

lie Ser Trp ALa Lys Lys lie Pro Gly Phe Val Glu Leu Ser Leu Phe 
260 265 270 

Asp Gin Val Arg Leu Leu Glu Ser Cys Trp Met Glu Val Leu Met Met 
275 280 285 

Gly Leu Met Trp Arg Ser lie Asp His Pro Gly Lys Leu lie Phe Ala 
290 295 300 

Pro Asp Leu Val Leu Asp Arg Asp Glu Gly Lys Cys Val Glu Gly lie 
305 310 315 320 

Leu Glu lie Phe Asp Met Leu Leu Ala Thr Thr Ser Arg Phe Arg Glu 
325 330 335 

Leu Lys Leu Gin His Lys Glu Tyr Leu Cys Val Lys Ala Met lie Leu 
340 345 350 

Leu Asn Ser Ser Met Tyr Pro Leu Val Thr Ala Thr Gin Asp Ala Asp 
355 360 365 

Ser Ser Arg Lys Leu Ala His Leu Leu Asn Ala Val Thr Asp Ala Leu 

370 375 380 

Val Trp Val lie Ala Lys Ser Gly lie Ser S^r Gin Gin Gin Ser Met 
385 390 39!} 400 



Arg Leu Ala Asn Leu Leu Met Leu Leu Ser His Val Arg His Ala Arg 
405 410 415 



10 




4 0 



(2) INFORMATION FOR SEQ ID NO: 7: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 29 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : both 
{ V ) TOPOLOGY : unknown 

(ii) MOLECULE TYPE: cDNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 7: 

15 GGI GAY GAR G CWTCIGGITG YCAYTAYGG 2 9 

(2) INFORMATION FOR SEQ ID NO: 8: 

(i) SEQUENCE CHARACTERISTICS: 
20 (A) LENGTH: 29 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

( D) TOPOLOGY : linear 

25 (ii) MOLECULE TYPE: cDNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 8: 

30 

AAGCCTGGSA YICKYTTIGC CCAIYTIAT 2 9 

(2) INFORMATION FOR SEQ I D NO : 9: 

35 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 22 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



4 1 



(ii) MOLECULE TYPE: cDNA 



(xi) SEQUENCE DESCRIPTION: SEQ I D NO : 9: 
TGTTACGAAG TGGGAATGGT GA 22 
10 (2) INFORMATION FOR SEQ ID NO: 10: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 24 base pairs 

(B) TYPE: nucleic acid 
15 (C) STRANDEDNESS: single 

<D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 

20 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 10: 
TTGACACCAG ACCAACTGGT AATG 2 4 

25 

(2) INFORMATION FOR SEQ ID NO: 11: 



(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 24 base pairs 
30 (B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

( D) TOPOLOGY : linear 



(ii) MOLECULE TYPE: cDNA 



35 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 11: 



10 



20 
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GGTGGCGACG ACTCCTGGAG CCCG 2 4 

(2) INFORMATION FOR SEQ ID NO: 12: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 22 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

( D ) TOPOLOGY : linear 

(ii) MOLECULE TYPE: cDNA 



15 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 12: 

GTACACTGAT TTGTAGCTGG AC 22 



(2) INFORMATION FOR SEQ ID NO: 13: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 
25 (D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 



30 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO; 13: 
C CAT GAT GAT GTCCCTGACC 2 0 

35 (2) INFORMATION FOR SEQ ID NO: 14: \ 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 base pairs 

(B) TYPE: nucleic acid 




V 
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25 



(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 14: 

10 TCGCATGCCT GACGTGGGAC 2 0 

(2) INFORMATION FOR SEQ ID NO: 15: 

(i) SEQUENCE CHARACTERISTICS: 
15 (A) LENGTH: 24 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

20 (ii) MOLECULE TYPE: cDNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 15: 
GGCSTCCAGC ATCTCCAGSA RCAG 2 4 



(2) INFORMATION FOR SEQ ID NO: 16: 

30 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

35 



(ii) MOLECULE TYPE: cDNA 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 16: 
GGAAGCTGGC TCACTTGCTG 2 0 

5 (2) INFORMATION FOR SEQ ID NO: 17: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 base pairs 

(B) TYPE: nucleic acid 
10 (C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 

15 

{xi) SEQUENCE DESCRIPTION: SEQ ID NO: 17: 
TCTTGTTCTG GACAGGGATG 2 0 

(2) INFORMATION FOR SEQ ID NO: 18: 



20 



(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 20 base pairs 
25 (B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



30 



(ii) MOLECULE TYPE: cDNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 18: 



3 5 G CAT G G AAC A TCTGCTCAAC 



(2) INFORMATION FOR SEQ ID NO: 19: 



20 



(i) SEQUENCE CHARACTERISTICS: 




35 
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(A) LENGTH: 21 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 



10 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 19: 

AGCAAGTTCA GCCTGTTAAG T 21 
(2) INFORMATION FOR SEQ ID NO: 20: 

15 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1257 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 
20 (D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 

25 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 20: 

AT GAAT T AC A GCATTCCCAG CAATGTCACT AACTT GGAAG GTGGGCCTGG TCGGCAGACC 60 

3 0 ACAAGCCCAA ATGTGTTGTG GCCAACACCT GGGCACCTTT CTCCTTTAGT GGTCCATCGC 12 0 

CAGTTATCAC ATCTGTATGC GGAACCTCAA AAGAGTCCCT GGTGTGAAGC AAGATCGCTA 18 0 

GAACACACCT TACCTGTAAA CAGAGAGACA CTGAAAAGGA AGGTTAGTGG GAACCGTTGC ,2 4 0 

GCCAGCCCTG TTACTGGTCC AGGTTCAAAG AGGGATGCTC ACTTCTGCGC TGTCTGCAGC 300 

GAT T AC G CAT C G G GAT AT C A CTATGGAGTC TGGTCGTGTG AAGGAT GTAA GGCCTTTTTT 3 60 



10 



30 
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AAAAGAAGCA TTCAAGGACA TAAT GAT TAT ATTTGTCCAG CTACAAATCA GTGTACAATC 420 

GAT AAAAAC C GGCGCAAGAG CTGCCAGGCC TGCCGACTTC GGAAGTGTTA CGAAGTGGGA 4 80 

AT GGT GAAGT GTGGCTCCCG GAGAGAGAGA TGTGGGTACC GCCTTGTGCG GAGACAGAGA 54 0 

AGTGCCGACG AGCAGCTGCA CTGTGCCGGC AAGGCCAAGA GAAGT GGCGG CCACGCGCCC 600 

CGAGTGCGGG AGCTGCTGCT GGACGCCCTG AGCCCCGAGC AGCTAGTGCT CACCCTCCTG 660 

GAGGCTGAGC CGCCCCATGT GCTGATCAGC CGCCCCAGTG CGCCCTTCAC CGAGGCCTCC 720 

AT GAT GAT GT CCCTGACCAA GTTGGCCGAC AAGGAGTTGG TACACATGAT CAGCTGGGCC 7 80 

15 AAGAAGATTC CCGGCTTTGT GGAGCTCAGC CTGTTCGACC AAGTGCGGCT CTTGGAGAGC 84 0 

TGTTGGATGG AGGTGTTAAT GATGGGGCTG ATGTGGCGCT CAATTGACCA CCCCGGCAAG 900 

CTCATCTTTG CTCCAGATCT TGTTCTGGAC AGGGATGAGG GGAAATGCGT AGAAGGAATT 960 

20 

CTGGAAATCT TTGACATGCT CCTGGCAACT ACTTCAAGGT TTCGAGAGTT AAAACTCGAA 1020 

CACAAAGAAT ATCTCTGTGT CAAGGCCATG ATCCTGCTCA ATT C CAGT AT GTACCCTCTG 1080 

2 5 GTCACAGCGA CCCAGGATGC TGACAGCAGC CGGAAGCTGG CTCACTTGCT GAACGCCGTG 114 0 

ACCGATGCTT TGGTTTGGGT GATTGCCAAG AGCGGCATCT CCTCCCAGCA GCAAT C CAT G 12 00 



CGCCTGGCTA ACCTCCTGAT GCTCCTGTCC CACGTCAGGC ATGCGAGGTC TGCCTGA 12 57 



(2) INFORMATION FOR SEQ ID NO: 21: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 418 amino acids 
35 (B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: protein 



\ 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 21: 

5 

Met Asn Tyr Ser lie Pro Ser Asn Val Thr Asn Leu Glu Gly Gly Pro 
15 10 15 

Gly Arg Gin Thr Thr Ser Pro Asn Val Leu Trp Pro Thr Pro Gly His 

10 20 25 30 

Leu Ser Pro Leu Val Val His Arg Gin Leu Ser His Leu Tyr Ala Glu 
35 40 45 

15 Pro Gin Lys Ser Pro Trp Cys Glu Ala Arg Ser Leu Glu His Thr Leu 

50 55 60 

Pro Val Asn Arg Glu Thr Leu Lys Arg Lys Val Ser Gly Asn Arg Cys 
65 70 75 80 

20 

Ala Ser Pro Val Thr Gly Pro Gly Ser Lys Arg Asp Ala His Phe Cys 
85 90 95 

Ala Val Cys Ser Asp Tyr Ala Ser Gly Tyr His Tyr Gly Val Trp Ser 
25 100 105 110 

Cys Glu Gly Cys Lys Ala Phe Phe Lys Arg Ser lie Gin Gly His Asn 
115 120 125 

30 Asp Tyr lie Cys Pro Ala Thr Asn Gin Cys Thr lie Asp Lys Asn Arg 

130 135 140 

Arg Lys Ser Cys Gin Ala Cys Arg Leu Arg Lys Cys Tyr Glu Val Gly 
145 150 155 160 



35 



Met Val Lys Cys Gly Ser Arg Arg Glu Arg Cys Gly Tyr Arg Leu Val 
165 170 175 



Arg Arg Gin Arg Ser Ala Asp Glu Gin Leu His Cys Ala Gly Lys Ala 



4 8 



180 



185 



190 



Lys Arg Ser Gly Gly His Ala Pro Arg Val Arg Glu Leu Leu Leu Asp 
195 200 205 

Ala Leu Ser Pro Glu Gin Leu Val Leu Thr Leu Leu Glu Ala Glu Pro 

210 215 220 



10 



Pro His Val Leu lie Ser Arg Pro Ser Ala Pro Phe Thr Glu Ala Ser 
225 230 235 240 



Met Met Met Ser Leu Thr Lys Leu Ala Asp Lys Glu Leu Val His Met 
245 250 255 



15 



lie Ser Trp Ala Lys Lys lie Pro Gly Phe Val Glu Leu Ser Leu Phe 
260 265 270 



20 



Asp Gin Val Arg Leu Leu Glu Ser Cys Trp Met Glu Val Leu Met Met 
275 280 285 

Gly Leu Met Trp Arg Ser lie Asp His Pro Gly Lys Leu lie Phe Ala 

290 295 300 



25 



Pro Asp Leu Val Leu Asp Arg Asp Glu Gly Lys Cys Val Glu Gly lie 

305 310 315 320 



Leu Glu lie Phe Asp Met Leu Leu Ala Thr Thr Ser Arg Phe Arg Glu 

325 330 335 



30 



Leu Lys Leu Gin His Lys Glu Tyr Leu Cya Val Lys Ala Met lie Leu 
340 345 350 



35 



Leu Asn Ser Ser Met Tyr Pro Leu Val Thr Ala Thr Gin Asp Ala Asp 

355 360 365 

Ser Ser Arg Lys Leu Ala His Leu Leu Asn Ala Val Thr Asp Ala Leu 

370 375 380 



Val Trp Val lie Ala Lys Ser Gly lie Ser Ser Gin Gin Gin Ser Met 
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385 



390 



395 



400 



Arg Leu Ala Asn Leu Leu Met Leu Leu Ser His Val Arg His Ala Arg 
405 410 415 



Ser Ala 



(2) INFORMATION FOR SEQ ID NO: 22: 



10 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 34 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 
15 (D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 



20 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 22: 
CTTGGATCCA TAGCCCTGCT GTGATGAATT ACAG 34 
25 (2) INFORMATION FOR SEQ ID NO: 23: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 33 base pairs 

(B) TYPE: nucleic acid 
30 (C) STRANDEDNESS : single 

( D ) TOPOLOGY : linear 

(ii) MOLECULE TYPE: cDNA 

35 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 23: 



GATGGATCCT CACCTCAGGG CCAGGCGTCA CTG 



33 




(2) INFORMATION FOR SEQ ID NO: 24: 

(i) SEQUENCE CHARACTERISTICS: 
5 (A) LENGTH: 1898 base pairs 

{ B ) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

10 (ii) MOLECULE TYPE: cDNA 



20 



15 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 24: 

CACGAATCTT TGAGAACATT ATAATGACCT TTGTGCCTCT TCTTGCAAGG TGTTTTCTCA 60 

GCTGTTATCT CAAGACATGG ATATAAAAAA CTCACCATCT AGCCTTAATT CTCCTTCCTC 12 0 

CTACAACTGC AGTCAATCCA TCTTACCCCT GGAGCACGGC TCCATATACA TACCTTCCTC 180 

CTATGTAGAC AGC C AC CAT G AATATCCAGC CAT GACATT C TATAGCCCTG CTGTGATGAA 24 0 

2 5 TTACAGCATT CCCAGCAATG TCACTAACTT GGAAGGTGGG CCTGGTCGGC AGAC CACAAG 300 

\ 

CCCAAATGTG TTGTGGCCAA CACCTGGGCA CCTTTCTCCT TTAGTGGTCC ATCGCCAGTT 3 60 

AT C AC AT C T G TATGCGGAAC CTCAAAAGAG TCCCTGGTGT GAAGCAAGAT CGCTAGAACA 42 0 

3 0 

CACCTTACCT GTAAACAGAG AGACACTGAA AAGGAAGGTT AGTGGGAACC GTTGCGCCAG 4 80 

CCCTGTTACT GGTCCAGGTT CAAAGAGGGA TGCTCACTTC TGCGCTGTCT GCAGCGATTA 54 0 

35 CGCATCGGGA TAT CACT AT G GAGTCTGGTC GTGTGAAGGA TGTAAGGCCT TTTTTAAAAG 600 

AAGCATTCAA GGACATAATG AT TAT AT T T G T C C AG C T AC A AATCAGTGTA CAATCGATAA 6 60 

AAACCGGCGC AAGAGCTGCC AGGCCTGCCG ACTTCGGAAG TGTTACGAAG TGGGAATGGT 72 0 




- 51 - 

GAAGTGTGGC TCCCGGAGAG AGAGATGTGG GTACCGCCTT GTGCGGAGAC AGAGAAGTGC 7 80 

C GAC GAG C AG CTGCACTGTG CCGGCAAGGC CAAGAGAAGT GGCGGCCACG CGCCCCGAGT 8 40 

5 

GCGGGAGCTG CTGCTGGACG CCCTGAGCCC C GAG C AG C T A GTGCTCACCC TCCTGGAGGC 9 00 

TGAGCCGCCC CATGTGCTGA TCAGCCGCCC CAGTGCGCCC TTCACCGAGG CCTCCATGAT 9 60 

10 GATGTCCCTG ACCAAGTTGG CCGACAAGGA GTTGGTACAC ATGATCAGCT GGGC CAAGAA 1020 

GATTCCCGGC TTTGTGGAGC TCAGCCTGTT CGACCAAGTG CGGCTCTTGG AGAGCTGTTG 108 0 

GAT GGAGGT G TTAATGATGG GGCTGATGTG GCGCTCAATT GACCACCCCG GCAAGCTCAT 114 0 

15 

CTTTGCTCCA GATCTTGTTC TGGACAGGGA TGAGGGGAAA TGCGTAGAAG GAATTCTGGA 12 00 

AATCTTTGAC ATGCTCCTGG CAACTACTTC AAGGTTTCGA GAGTTAAAAC TCCAACACAA 12 60 

20 AGAATATCTC TGTGTCAAGG CCATGATCCT GCTCAATTCC AG TAT GTAC C CTCTGGTCAC 1320 

AGCGACCCAG GATGCTGACA GCAGCCGGAA GCTGGCTCAC TTGCTGAACG CCGTGACCGA 1380 

TGCTTTGGTT TGGGTGATTG CCAAGAGCGG CATCTCCTCC CAGCAGCAAT CCATGCGCCT 144 0 

25 

GGCTAACCTC CTGATGCTCC TGTCCCACGT CAGGCATGCG AGTAACAAGG GCAT GGAACA 1500 

TCTGCTCAAC AT GAAGT G C A AAAATGTGGT CCCAGTGTAT GACCTGCTGC TGGAGATGCT 15 60 

3 0 GAATGCCCAC GTGCTTCGCG GGTGCAAGTC CTCCATCACG GGGTCCGAGT GCAGCCCGGC 162 0 

AGAGGACAGT AAAAGCAAAG AGGGCTCCCA GAAC C C AC AG TCTCAGTGAC GCCTGGCCCT 168 0 

GAGGTGAACT G G C C C AC AG A GGTCACAAGC TGAAGCGTGA ACT C CAGT GT GTCAGGAGCC %7 4 0 

TGGGCTTCAT CTTTCTGCTG TGTGGTCCCT CATTTGGTGA TGGCAGGCTT GGT CAT GTAC 18 00 

CATCCTTCCC TCCACCTTCC CAACTCTCAG GAGTCGGTGT GAG GAAGC C A TAGTTTCCCT 18 60 



35 



10 
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TGTTAGCAGA GGGACATTTG AATCGAGCGT TTCCACAC 18 98 

(2) INFORMATION FOR SEQ ID NO: 25: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 530 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 
<D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 



20 



15 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 25: 

Met Asp lie Lys Asn Ser Pro Ser Ser Leu Asn Ser Pro Ser Ser Tyr 
15 10 15 

Asn Cys Ser Gin Ser lie Leu Pro Leu Glu His Gly Ser lie Tyr lie 
20 25 30 

Pro Ser Ser Tyr Val Asp Ser His His Glu Tyr Pro Ala Met Thr Phe 

25 35 40 45 

Tyr Ser Pro Ala Val Met Asn Tyr Ser lie Pro Ser Asn Val Thr Asn 

50 55 60 

30 Leu Glu Gly Gly Pro Gly Arg Gin Thr Thr Ser Pro Asn Val Leu Trp 

65 70 75 80 

Pro Thr Pro Gly His Leu Ser Pro Leu Val Val His Arg Gin Leu Ser 

85 90 95 



35 



His Leu Tyr Ala Glu Pro Gin Lys Ser Pro Trp Cys Glu Ala Arg Ser 
100 105 110 



Leu Glu His Thr Leu Pro Val Asn Arg Glu Thr Leu Lys Arg Lys Val 
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115 



120 



125 



Ser Gly Asn Arg Cys Ala Ser Pro Val Thr Gly Pro Gly Ser Lys Arg 
130 135 140 

Asp Ala His Phe Cys Ala Val Cys Ser Asp Tyr Ala Ser Gly Tyr His 

145 150 155 160 



10 



Tyr Gly Val Trp Ser Cys Glu Gly Cys Lys Ala Phe Phe Lys Arg Ser 
165 170 175 



lie Gin Gly His Asn Asp Tyr lie Cys Pro Ala Thr Asn Gin Cys Thr 
180 185 190 



15 



lie Asp Lys Asn Arg Arg Lys Ser Cys Gin Ala Cys Arg Leu Arg Lys 
195 200 205 



20 



Cys Tyr Glu Val Gly Met Val Lys Cys Gly Ser Arg Arg Glu Arg Cys 

210 215 220 

Gly Tyr Arg Leu Val Arg Arg Gin Arg Ser Ala Asp Glu Gin Leu His 

225 230 235 240 



25 



Cys Ala Gly Lys Ala Lys Arg Ser Gly Gly His Ala Pro Arg Val Arg 
245 250 255 



Glu Leu Leu Leu Asp Ala Leu Ser Pro Glu Gin Leu Val Leu Thr Leu 
260 265 270 



30 



Leu Glu Ala Glu Pro Pro His Val Leu lie Ser Arg Pro Ser Ala Pro 
275 280 285 



35 



Phe Thr Glu Ala Ser Met Met Met Ser Leu Thr Lys Leu Ala Asp Lys 

290 295 300 

Glu Leu Val His Met lie Ser Trp Ala Lys Lys He Pro Gly Phe Val 

305 310 315 320 



Glu Leu Ser Leu Phe Asp Gin Val Arg Leu Leu Glu Ser Cys Trp Met 




5 4 



325 



330 



335 



Glu Val Leu Met Met Gly Leu Met Trp Arg Ser lie Asp His Pro Gly 
340 345 350 

Lys Leu lie Phe Ala Pro Asp Leu Val Leu Asp Arg Asp Glu Gly Lys 

3^5 360 365 

Cys Val Glu Gly lie Leu Glu lie Phe Asp Met Leu Leu Ala Thr Thr 
370 375 380 

Ser Arg Phe Arg Glu Leu Lys Leu Gin His Lys Glu Tyr Leu Cys Val 
385 390 395 400 

Lys Ala Met lie Leu Leu Asn Ser Ser Met Tyr Pro Leu Val Thr Ala 
405 410 415 

Thr Gin Asp Ala Asp Ser Ser Arg Lys Leu Ala His Leu Leu Asn Ala 
420 425 430 

Val Thr Asp Ala Leu Val Trp Val lie Ala Lys Ser Gly lie Ser Ser 
435 440 445 

Gin Gin Gin Ser Met Arg Leu Ala Asn Leu Leu Met Leu Leu Ser His 
450 455 460 

Val Arg His Ala Ser Asn Lys Gly Met Glu His Leu Leu Asn Met Lys 
465 470 475 480 

Cys Lys Asn Val Val Pro Val Tyr Asp Leu Leu Leu Glu Met Leu Asn 
485 490 495 

Ala His Val Leu Arg Gly Cys Lys Ser Ser lie Thr Gly Ser Glu Cys 

500 505 510 



Ser Pro Ala Glu Asp Ser Lys Ser Lys Glu Gly Ser Gin Asn Pro Gin 
515 520 525 



Ser Gin 




\ 
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530 

(2) INFORMATION FOR SEQ ID NO: 26: 

5 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 30 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

10 

(ii) MOLECULE TYPE: other nucleic acid 



15 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 26: 
GTGCGGATCC TCTCAAGACA T GGAT AT AAA 30 

2 0 (2) INFORMATION FOR SEQ ID NO: 27: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 25 base pairs 

(B) TYPE: nucleic acid 
25 (C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: other nucleic acid 

30 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 27: 

3 5 AGTAACAGGG CTGGCGCAAC GGTTC 2 5 

(2) INFORMATION FOR SEQ ID NO: 28: 
(i) SEQUENCE CHARACTERISTICS: 
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(A) LENGTH: 22 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

5 

(ii) MOLECULE TYPE: other nucleic acid 



10 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 28: 
ACTGGCGATG GACCACTAAA GG 22 



15 



